00:00
2026-05-07
machinelearning.apple.com
computer-vision
Text-Conditional JEPA for Learning Semantically Rich Visual Representations
Researchers Chen Huang, Xianhang Li, Vimal Thilak, Etai Littwin, and Josh Susskind have developed Text-Conditional JEPA (TC-JEPA), a visual self-supervised learning model that uses image captions to rโฆ